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ABSTRACT 

In this work, we propose a novel adaptive reduced-rank strategy 
based on joint interpolation, decimation and filtering (JIDF) for large 
multiuser multiple-input multiple-output (MIMO) systems. In this 
scheme, a reduced-rank framework is proposed for linear receive 
processing and multiuser interference suppression according to the 
minimization of the bit error rate (BER) cost function. We present 
a structure with multiple processing branches that performs dimen- 
sionality reduction, where each branch contains a group of j ointly 
optimized interpolation and decimation units, followed by a linear 
receive filter. We then develop stochastic gradient (SG) algorithms to 
compute the parameters of the interpolation and receive filters along 
with a low-complexity decimation technique. Simulation results are 
presented for time- varying environments and show that the proposed 
MBER- JIDF receive processing strategy and algorithms achieve a 
superior performance to existing methods at a reduced complexity. 

Index Terms — Adaptive filtering, minimum-BER, reduced- 
rank techniques, massive MIMO, stochastic gradient algorithms. 

1. INTRODUCTION 

Large MIMO systems have received significant attention in the re- 
cent years since they can substantially increase the system capacity 
and improve the quality and reliability of wireless links 1 1 1. Differ- 
ent configurations have been investigated for large MIMO systems, 
such as distributed and centralized MIMO schemes. Key applica- 
tions of these systems include wireless cellular, local area (2) [3] El 
and multi-beam satellite networks |5 |. The problem of detecting a 
desired user in a large multiuser MIMO system presents many sig- 
nal processing challenges including the need for algorithms with the 
ability to process large-dimensional received data, fast and accurate 
adjustment of parameters, scalable computational complexity and 
the development of cost-effective interference mitigation schemes. 

In this context, reduced-rank signal processing is a key tool for 
large systems which can provide faster training, a better tracking per- 
formance and an increased robustness against interference as com- 
pared to standard methods. A number of reduced-rank techniques 
have been developed to design the dimensionality reduction matrix 
and the reduced-rank receive filter ll6l- III51 . Among the first schemes 
are the eigendecomposition-based (EIG) algorithms |6 |, |7| and the 
multistage Wiener filter (MWF) investigated in |8|-|10|. EIG and 
MWF have faster convergence speed compared to the full rank adap- 
tive algorithms with a much smaller filter size, but their computa- 
tional complexity is high. A strategy based on the joint and iterative 
optimization (JIO) of a subspace projection matrix and a reduced- 
rank filter has been reported in III2lll3|[T4][T5l . 

However, most of the contributions to date are either based on 
the minimization of the mean square error (MSE) and/or the min- 
imum variance criteria [6|-[15 |, which are not the most appropri- 
ate metric from a performance viewpoint in digital communications. 



Design approaches that can minimize the bit error rate (BER) have 
been reported in fl6] [17] [18] [19] [20] and are termed adaptive MBER 
techniques. The work in 1181 appears to be the first approach to com- 
bine a reduced-rank algorithm with the BER criterion. However, the 
scheme is a hybrid between an EIG or an MWF approach, and a 
BER scheme in which only the reduced-rank filter is adjusted in an 
MBER fashion. Moreover, the existing works on MBER techniques 
have not addressed the key problem of performance degradation ex- 
perienced when the filters become larger and their performance con- 
verges gradually to MSE-based techniques. 

In this work, we propose an adaptive reduced-rank linear receive 
processing strategy based on joint interpolation, decimation and fil- 
tering (JIDF) for large multiuser MIMO systems. The proposed 
scheme employs a multiple-branch framework which adaptively per- 
forms dimensionality reduction using a set of jointly optimized in- 
teipolation and decimation units, followed by receive filtering ac- 
cording to the BER cost function. The dimensionality reduction is 
optimized at each time instant by selection of the interpolation filter 
and the decimation pattern with the best performance. After dimen- 
sionality reduction, a linear receive filter with reduced dimension de- 
signed using the BER criterion is applied to suppress the multiuser 
interference and estimate the data symbols. We devise stochastic 
gradient (SG) algorithms to compute the parameters of the interpo- 
lation and receive filters along with a low-complexity decimation 
technique. A unique feature of our scheme is that all component fil- 
ters have a small number of parameters and can take full advantage 
of the MBER adaptation. Simulation results show that the proposed 
MBER- JIDF receive processing strategy and algorithms have a su- 
perior performance to existing techniques at a reduced complexity. 

2. SYSTEM MODEL AND PROBLEM STATEMENT 

Let us consider the uplink of an uncoded synchronous multiuser 
MIMO system with K users and one base station (BS) |21 1-|24|, 
where each user is equipped with Nu antennas and the BS is 
equipped with M uncorrelated receive antennas and KNu < M. 
We assume that the channel is a MIMO time-varying flat fading 
channel. The M-dimensional received vector is given by 

K 

r(i) =^^feHfe(i)bfc(i) + n(j), (1) 

fc=i 

where bfc(j) = [bk.i{i) ■ ■ ■ bk.n{i) ■ ■ ■ &fc,]v„(i)]^ is a Nu x 1 sym- 
bol vector of user k corresponding to the i-th time instant, n — 
1, . . . , Nu, and the amplitude of user k is Ak, k — 1, . . . ,K. The 
M X Nu matrix Hfc(i) is the channel matrix of user k, which is 
given by 

Hfc(j) = [hk,i{i}...hkj{i)...hk,M{i)f, (2) 

where the Nu x 1 channel vectors hk.f{i), for / = 1, . . . ,M, 
consist of independent and identically distributed complex Gaussian 



variables with zero mean and unit variance, n(i) — [ni (i) ■ ■ ■ um («)] where the M x M Toeplitz convolution matrix P; (i) is given by 



is the complex Gaussian noise vector with zero mean and iJ[n(i)n^(i)] = 
(T^I, where is the noise variance, (.)"^ and (.)^ denote transpose 
and Hermitian transpose, respectively. 

In the following, we explain the design of reduced-rank receive 
processing schemes which minimize the BER. In a reduced-rank al- 
gorithm, an M X D subspace projection matrix Sd is applied to 
the received data to extract the most important information of the 
data by performing dimensionality reduction, where 1 < D < M. 
A D X 1 projected received vector is obtained as r(i) = S%r{i), 
where it is the input to a _D x 1 filter w. The filter output is given 
by Xk^ri{i) = w^f (i) — vi^ S%r{i). Assuming that we use binary 
signalling, the estimated symbol of user k is given by bk,n{i) = 
sign{5R[w^r(i)]}, where the operator retains the real part of 
the argument and sign{.} is the signum function. The probability of 
error for user k is given by 



,n 



in{bk,n{i)Wxk,7i{i)] 
p(w»SgSi5w)^ 



(3) 



where Xk,n ~ sign{6fc,„(i)}5R[xfc,„(i)] denotes a random variable, 
/(xk.n) is the single point kernel density estimate ||16il which is 
given by 



f{Xk,n) = 



oxp 



-(xk,„ - sign{bk,n}^[xk,n]y 



(4) 



where p is the radius parameter of the kernel density estimate, Q{.) 
is the Gaussian error function. The problem we are interested in 
solving is how to devise a cost-effective algorithm to adjust the pa- 
rameters of Sd and w based on minimizing the probability of error 
with reduced length component filters. 

3. PROPOSED MBER-JIDF REDUCED-RANK LINEAR 
RECEIVE PROCESSING SCHEME 

In this section, we detail the proposed MBER reduced-rank linear 
receive processing scheme based on joint interpolation, decimation 
and filtering, which comes from two observations. The first is that 
rank reduction can be performed by reconstructing new samples with 
interpolators and eliminating (decimating) samples that are not use- 
ful in the filtering process 11141 . The second comes from the structure 
of the dimensionality reduction matrix, whose columns are a set of 
vectors formed by the interpolators and decimators. 

3.1. Overview of the MBER-JIDF Scheme 

We design the subspace projection matrix Sd by considering in- 
terpolation and decimation. In this case, the receive filter length is 
substantially reduced, which results in significantly reduced compu- 
tational complexity and very fast training for large MIMO systems. 
The proposed MBER-JIDF scheme for the n-th symbol of the fc-th 
user is depicted in Fig. [T] The AI x 1 received vector r(i) is pro- 
cessed by a framework with B branches, where each branch contains 
an interpolator and a decimation unit, followed by a reduced-rank 
receive filter. In the l-th branch, the received vector is operated by 
the interpolator pi{i) = . . . ,pi^i{i)]^ with filter length /, 

/ < M, the output of the interpolator of the l-th branch is expressed 
by 

h{i) = Pf{i)r{{) (5) 
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In order to facilitate the description of the scheme, we introduce an 
alternative way to represent the vector ?; (i). 



n(i)=Pf(i)r(i) = R(j)pr(i) 

where the M x I matrix R (i) with the samples of r(i) 
[ro(i), . . . , rM-i(i)]"^ has a Hankel structure |261 given by 



(6) 



R (i) 



ri{i) 



rM-i(i) TM-i+iii) 

rM-i+i{i) rM-i+2{i) 

rM-2{i) rM~i{i) 
\ rM-i(i) 



ri-i{i) 

rM-i{i) 




The dimensionality reduction is performed by a decimation unit with 
DxM decimation matrices T; that projects ?; (i) onto _D x 1 vectors 
r; (i) with I — 1, . . . , B, where D is the rank. The D x 1 vector f ; (i) 
for the l-th branch is given by 

fi{i) = TiFf{i)r{i)=Tiri{i) = TiK{i)p:{{) (7) 

So,l(i) 

where SD,i{i) denotes the equivalent subspace projection matrix 
corresponding to the l-th branch. The output of the reduced-rank 
receive filter w(i) corresponding to the l-th branch is given by 
Xk n(*) = w^(i)r;(i), which is used in the minimization of the 
error probability for branch I. The hard decision for the l-th branch 
is given by &i,„(j) ~ sign{5ft[w^r;(i)]}. The proposed scheme 
employs B parallel branches of interpolators and decimators. The 
optimum branch is selected according to 



;ign{bfc.„(i)}iR[xL„(i)] 
(8) 

Theoutput of the scheme is given by ^(i) = sign{5R[w^r;„j,t (i)]} 



lovt ~ are; min P; ' , where ' — i 

KKB 



3.2. Design of the Decimation Unit 

In this work, the elements of the decimation matrix only take the 
value or 1. This corresponds to the decimation unit simply keep- 
ing or discarding the samples. The optimal decimation scheme ex- 
haustively explores all possible patterns which select D samples 
out of M samples. In this case, the scheme can be viewed as a 
combinatorial problem and the total number of patterns is i? = 
M{M -1) ...{M - D + 1). 
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Fig. 1. Structure of the proposed MBER-JIDF scheme 



However, the optimal decimation scheme is too complex for 
practical use. We introduce a low-complexity suboptimal method 
to generate the decimation matrix. It employs a structure formed in 
the following way 



where the M x 1 vector ti.d denotes the d-th basis vector of the Z-th 
decimation unit, d — 1, . . . , D, I — 1, . . . , B, and its structure is 
given by 

t,,d = [0^_^,1, 0^_^1^ (9) 

where qi,d is the number of zeros before the nonzero element. Note 
that it is composed of a single 1 and M — 1 Os. We set the value 
of qi^d in a deterministic way which can be expressed as qi^d = 
[■^J X (d — 1) + (Z — 1). The simulation results will show that 
the proposed reduced-rank scheme with the suboptimal decimation 
unit design method works very well. In the following section, we 
will introduce the proposed adaptive algorithms for the interpolator 
filter Pi (i) and the reduced-rank receive filter w(i). 



4. PROPOSED ADAPTIVE ALGORITHMS 

In this section, we develop the MBER based adaptive SG algo- 
rithms to update the interpolator and the reduced-rank filters for 
each branch. We then provide a computational complexity analysis 
of the proposed and conventional adaptive reduced-rank algorithms. 

4.1. Adaptive MBER-JIDF Algorithms 

Firstly, we derive the gradient terms for the reduced-rank filter and 
the interpolation vector By taking the gradient of (|8) with respect to 
w* and after further mathematical manipulations we obtain 



(0 



- exp 



sign{fefe,„(i)} 



9w* 



2V27rp 



ml 



(i)]Sg,,SD,,W 



(10) 



To derive the gradient terms for the interpolator Pi(i), we need to 
express the output of the Z-th branch xj. as a function of 
which is given by 



4,n» = w^(j)Ti(j)R'(i)pr(i) = pf (i)u(i) 



(11) 



where u(i) — R "^(i)Tf (i)w*(i) is an / x 1 vector We let u(i) = 
ui(i), . . . , uiii)]^ and rewrite the error probability cost function 

sign{fefe,„(i)}K[pi,iMi +p2dU2 



as follows 



pP = ^ 



+ Pl,lUl\ 



P\/9{Pl,hP2,l, ■ ■ ■ ,Pl,l) 

where the function g{pi,i,P2,i, ■ ■ ■ ,Pi,i) is given by 



(12) 



g(pi,l,P2,l, ■ ■ ■ ,Pl,l) = WiipijplA + ■■■ +P4-l,lP*4,id)^l 

+ W2{p\,ip*l,l + ■■■ +PH,lPl2,l)^2 + ■ 

+ WDipidpl^i + ■ ■■ + P4>d,ipId,i)^*d 



(13) 



where (j)d denotes the number of nonzero elements for row d in the 
D X I matrix Ti(i)R'(j), 1 < d < D, / = 01 > (^2 > . . . > 
(j)D > 1. Note that g(pi,i,p2,i, ••• = w^S|5 jSd.jw, and 

we define w = [wi, . . . , wd]''" ■ By taking the gradient with respect 
to each element p* j in vector pi{i), j ~ 1, ...,/, we obtain 



dp 



(0 



exp 



2p^9(Pl,( Pi a) 



sign{6fe,„(i)} 



dp 



3,1 



2V27rp 



Uj {i) 



^Pl,l) 



52 (Pl 



,PI,l) 



(14) 



where ijjj denotes the number of nonzero elements for column j in 
the D X J matrix Ti(i)R' (j), 1 < j < /, D = Vi > "02 > • . • > 
> 1. We stack the / elements and obtain an J x 1 gradient 



vector as v; 



The interpolator and the reduced-rank receive filters are jointly 
optimized according to the BER criterion. The algorithm has been 
devised to start its operation in the training (TR) mode, and then 
to switch to the decision-directed (DD) mode. The proposed 
SG algorithms are obtained by substituting the gradient terms 
dlOb and il4l in the expressions w(j + 1) = w(j) — fiw-§^ 
and p;(i + 1) = pi(i) — i-ip'Vi |25| subject to the constraint of 
w^(j)Sg,;(«)SDj(i)w(i) = g(j>i^i,p2,i, . . . ,pi,i) = 1. At each 
time instant, the weights of the two quantities of branch I are updated 
in an alternating way by using the following equations 



exp 



w(i + 1) = w(i) + fi. 



2P 



sign{bfc,„(j)} 



2V27rp 

X (Sg,,(j)r(i) -5R[4,n(i)]Sg,Kj)So.,(i)w(i)) 

(15) 



pi{i + 1) = Pi(j) - X 



dPP dP, 



(0 



dP, 



dpl/ dp*/"" dpl, 
where each element in the gradient vector is given by 



(16) 



dp 



(0 



' exp 



7;^ — 



2p2 



;ign{6fe,„(i)} 



dp* I 2V^P 

X {ujii) ~ ?R.[xi/i)]{\w/ + ... + \w^.f)pjj), 



(17) 



where /i^, and fip are the step-size values. Expressions ( I15t and (I16t 
need initial values, w(0) and Pi(0), and we scale the interpolation 
vector by p; /_ at each iteration. The scaling has 



an equivalent performance to using a constrained optimization with 
Lagrange multipliers although it is computationally simpler. The 
proposed MBER-JIDF algorithm are summarized in Table [T] 



Table 1. Proposed adaptive MBER-JIDF algorithm. 

1 Set step-size values n^, and /i^j-, and the no. of branches B. 

2 Initialize w(0) andpi(O). Set Ti, . . . , Ts. 

3 for each time instant i do 

4 for I from 1 to B do 

5 Update p; (i + 1) based on T; and l ll6t . 

6 Scale the vector p; using p; 



Pi 



6 end 

7 Select the optimal branch based on ([8}. 
Generate the estimated symbol. 

8 Update w(i + 1) based on the selected branch and dlSb 



4.2. Computational Complexity 

In Table |2l we show the number of additions and multiplications of 
the proposed MBER-JIDF algorithm, the existing adaptive reduced- 
rank algorithms, the adaptive least-mean square (LMS) [25] and the 
SG full-rank algorithm based on the BER criterion 1 16 1. In the case 
of large MIMO systems, the parameters D, I and B are chosen much 
smaller than A/, which results in a substantial complexity saving. In 
particular, for a configuration with M = 40, 1 — D = 8 and 5 = 4, 
the numbers of multiplications and additions for the proposed al- 
gorithm are upper bounded by 1825 and 1595, respectively. For 
the MWF-MBER algorithm they are 15594 and 11857, respectively. 
Compared to the existing reduced-rank algorithms, the MBER-JIDF 
algorithm reduces the computational complexity significantly. 

Table 2. Computational complexity of Algorithms. 



Algorithm 



Number of operations per symbol 
Multiplications Additions 



Full-Rank-LMS 
Full-Rank-MBER 

EIG\1\ 
MBER-MWF ||18J 



MBER-JIDF 



2M + 1 
4M + 1 
0{hfi) 
{D + 1)M^ 
+(3D + 1)M 
+M + 3D -I- 10 

MDB + DB 
+71 B + 4_D + 1 



2M 
4M - 1 

{D - 1)M^ 
+{2D - 1)M 

+M + 2D + 1 
MDB + IB 

-B + 4:D-1 



5. SIMULATIONS 

In this section, we evaluate the performance of the proposed 
MBER-JIDF reduced-rank algorithm and compare it with exist- 
ing full-rank and reduced-rank algorithms. Monte-carlo simu- 
lations are conducted to verify the effectiveness of the MBER- 
JIDF adaptive reduced-rank SG algorithms. The number of re- 
ceive antennas at the BS is M = 40. The number of anten- 
nas per user is Nu = 2. The coefficients of the channel ma- 
trix Hfe(i) are computed according to Clarke's model [27 1. We 
have optimized the step sizes of each branch of the MBER-JIDF 



adaptive reduced-rank SG algorithms with the following rules, 

+ 0ifi^/p(i) + d2XQl •■ •■ — 

where [.]^_ denotes the truncation to the limits of a range. We tuned 

5i = 0.99, (52 = 1 X IQ-", = 1 X 10"^ and = 1 x 10"^ 
and set p — 2a (16]. The step sizes for LMS adaptive full-rank, SG 
adaptive MBER full-rank and the other reduced-rank techniques are 
0.085, 0.05 and 0.035, respectively. The initial full-rank, reduced- 
rank and interpolation filters are [1, 0, . . . , 0]^. The algorithms 
process 200 symbols in TR and 1000 symbols in DD. 

Fig|2](a) shows the BER performance of the desired user ver- 
sus the number of received symbols for the proposed MBER-JIDF 
scheme and the conventional full rank and reduced-rank algorithms. 
We set the rank D = S, I = 8, K = A, SNR = 15dB and faT = 
1 X 10^^. We can see that the proposed MBER-JIDF reduced- 
rank algorithms converge much faster than the conventional full rank 
and reduced-rank algorithms. Fig|2](b) illustrates the steady-state 
BER performance of the desired user versus the number of users K. 
We can see that the best performance is achieved by the proposed 
MBER-JIDF algorithms followed by the MWF-MBER algorithm, 
the full-rank MBER algorithm, the full-rank LMS algorithm and the 
eigen-decomposition-based algorithms. In particular, the MBER- 
JIDF algorithm using B — 4 can accommodate up to four more 
users in comparison with the MWF-MBER algorithm 1181 , at the 
BER level of 2 x 10"^ 



M=40, K.4, SNR=15dB 



M=40, SNR.15dB 




200 400 600 800 
(a) number of received symbols (b) number o1 users 

Fig. 2. BER performance versus (a) number of received symbols, 
K= 4, (b) number of users. Parameters: D = I — 8, SNR= 15dB, 
and Nu = 2. 

6. CONCLUSION 

In this paper, we have proposed an adaptive reduced-rank linear re- 
ceive processing scheme and MBER algorithms for interference sup- 
pression in large multiuser MIMO systems. For each branch, we 
designed a group of jointly optimized interpolation and decimation 
units, followed by linear receive filtering according to the minimiza- 
tion of the BER cost function. The final output is switched to the 
branch with the best performance based on the minimum error prob- 
ability. We have developed SG based algorithms for their adaptive 
implementation. The results have shown that the proposed scheme 
significantly outperforms existing algorithms and supports systems 
with higher loads. Future work will consider non-linear detectors, 
higher order modulation and other MIMO configurations. 
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